Robust Standard Error Estimate for Cluster Sampling Data: A SAS/IML Macro Procedure for Logistic Regression with Huberization

نویسنده

  • Honghu Liu
چکیده

Data sets with cluster structure are very common in practical business and research, one has to take into account the intra-cluster correlation in data analysis. This paper systematically discusses the Huber/White standard error estimate for cluster sampling data in logistic regression, and presents a user-friendly SAS/IML macro procedure which can automatically fit logistic model, calculate robust standard errors and produce confidence intervals for odds ratio. The robust standard error calculated in this procedure also has a finite sample-adjustment feature which is now available in most updated version of some statistical software. The syntax for the procedure is simple and easy. One data example is shown to illustrate how to actually use the procedures. The SAS system products included in this work are SAS, SAS/STAT and SAS/IML . This procedure can be run on any IBM compatible personal computer, MVS/UNIX system and any other computer platforms with a working SAS system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A SAS/IML Macro for Goodness-of-Fit Testing in Logistic Regression Models with Sparse Data

The logistic regression model has become the standard analyzing tool for binary responses in a variety of disciplines. Methods for assessing goodness-of-fit, however, are less developed and this is especially pronounced in calculating goodness-of-fit tests with sparse data, when the standard tests (deviance and Pearson test) behave unsatisfactorily. In our paper we show two solutions to the pro...

متن کامل

SAS Macro for Generalized Method of Moments Estimation for Longitudinal Data with Time-Dependent Covariates

Longitudinal data with time-dependent covariates is not readily analyzed as there are inherent, complex correlations due to the repeated measurements on the sampling unit and the feedback process between the covariates in one time period and the response in another. A generalized method of moments (GMM) logistic regression model (Lalonde, Wilson, and Yin 2014) is one method to analyze such corr...

متن کامل

LOGITSE: A SAS Macro for Logistic Regression Modeling in Complex Surveys

Traditional formulae for standard errors and subsequent statistical significance tests implemented in various popular statistical packages are based on the premise that the data are a simple random sample (SRS) of observations from a superpopulation. Equivalently, the observations are assumed to be independent and identically distributed (IID). For complex analytic surveys, these assumptions ar...

متن کامل

SUGI 28: Estimating Standard Errors for CLASS Variables in Generalized Linear Models Using PROC IML

Analysts who work frequently with linear models using the SAS System often use the LSMEANS statement in PROC GLM or PROC MIXED to obtain “adjusted means.” The absence of a clear analogue to these least-squares means for nonlinear or generalized linear models can be an obstacle for both the analyst and the eventual audience of the analysis. This paper shows how to use PROC IML to estimate standa...

متن کامل

A SAS/IML software program for GEE and regression diagnostics

A SAS/IML software program is described that computes regression diagnostics for generalized estimating equations. These diagnostics are computationally efficient and accurate approximations for the effect of deleting one observation or one cluster on individual regression coefficients (DFBETA) or on the overall fit of the model (Cook’s Distance). New formulae for the diagnostics are presented ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998